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Abstract 

Consider an infinite graph with nodes initially labeled by indepen- 
p ^ dent Bernoulli random variables of parameter p. We address the density 

• classification problem, that is, wc want to design a (probabilistic or dctcr- 

ministic) cellular automaton or a finite-range interacting particle system 
that evolves on this graph and decides whether p is smaller or larger than 
1/2. Precisely, the trajectories should converge to the uniform configu- 
I I ration with only O's if p < 1/2, and only I's if p > 1/2. We present 

solutions to that problem on Z'*, for any d> 2, and on the regular infinite 
^-H trees. For Z, we propose some candidates that we back up with numerical 

^ simulations. 

Keywords. Cellular automata, interacting particle systems, density 
classification, percolation. 

^ 1 Introduction 

Consider a finite or a countably infinite set of cells, which are spatially arranged 
' . l according to a group structure G. We are interested in the density classification 

^ problem, which consists of deciding in a decentralised way, if an initial configu- 

k>( ration on G contains more O's or more I's. More precisely, the goal is to design 

"j,^ a deterministic or probabilistic dynamical system that evolves in the configu- 

ration space {0, l}*^ with a local and homogeneous updating rule and whose 
trajectories converge to 0*^ or to 1*^ if the initial configuration contains more 
O's or more I's, respectively. To attack the problem, two natural instantiations 
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of dynamical systems are considered, one with synchronous updates of the cells, 
and one with asynchronous updates. In the first case, time is discrete, all cells 
are updated at each time step, and the model is known as a Probabilistic Cellu- 
lar Automaton (PCA) pT. A Cellular Automaton (CA) is a PCA in which the 
updating rule is deterministic. In the second case, time is continuous, cells are 
updated at random instants, at most one cell is updated at any given time, and 
the model is known as a (finite range) Interacting Particle System (IPS) |16j . 

The general spirit of the problem is that of distributed computing: gather- 
ing a global information by exchanging only local information. The challenge 
is two-fold: first, it is impossible to centralise the information (cells are indis- 
tinguishable); second, it is impossible to use classical counting techniques (cells 
contain only a binary information). 

The density classification problem was originally introduced for rings of fi- 
nite size (G = Z/nZ) and for synchronous models [T7|- After experimentally 
observing that finding good rules to perform this task was difficult, it was shown 
that perfect classification with CA is impossible, that is, there exists no given 
CA that solves the density classification problem for all values of n [Tl|. This 
result however did not stop the quest for the best - although imperfect - mod- 
els as nothing was known about how well CA could perform. The use of PCA 
opened a new path [SI [T5] and it was shown that there exist PCA that can solve 
the problem with an arbitrary precision In the present paper. Prop, [l] we 
complement the results from [T31 Sj by showing that there exists no PCA that 
solves the density classification problem for all values of n. 

The challenge is now to extend the research to infinite groups (whose Cayley 
graphs are lattices or regular trees). First, we need to specify the meaning of 
"having more O's or more I's" in this context. Consider a random configuration 
on {0, 1}*^ obtained by assigning independently to each cell a value 1 with 
probability p and a value with probability 1— p. We say that a model "classifies 
the density" if the trajectories converge weakly to 1*^ for p > 1/2, and to 
0*^ for p < 1/2. A couple of conjectures and negative results exist in the 
literature. Density classification on Z** is considered in [2] under the name of 
"bifurcation" . The authors study variants of the famous voter model IPS [TBI 
Ch. V] and they propose two instances that are conjectured to bifurcate. The 
density classification question has also been addressed for the Glauber dynamic 
associated to the Ising model at temperature 0, both for lattices and for trees [51 
[TTl [T^ . The Glauber dynamic defines an IPS or PCA having 0*^ and 1*^ as 
invariant measures. Depending on the cases, there is either a proof that the 
Glauber dynamic does not classify the density, or a conjecture that it docs with 
a proof only for densities sufficiently close to or 1. 

The density classification problem has been approached with different per- 
spectives on finite and infinite groups, as emphasised by the results collected 
above. For finite groups, the problem is studied per se, as a benchmark for un- 
derstanding the power and limitations of PCA as a computational model. The 
community involved is rather on the computer science side. For infinite groups, 
the goal is to understand the dynamics of specific models that are relevant 
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in statistical mechanics. The community involved is rather on the theoretical 
physics and probability theory side. 

The aim of the present paper is to investigate how to generalise the finite 
group approach to the infinite group case. We want to build models of PCA 
and IPS, as simple as possible, that correct random noise in the initial config- 
uration, even if the density of errors is close to 1/2. We consider the groups 
Z'^, whose Cayley graphs are lattices (Section^), and the free groups, whose 
Cayley graphs are infinite regular trees (Section 4]). In all cases, except for Z, 
we obtain both PCA and IPS models that classify the density. To the best of 
our knowledge, they constitute the first known such examples. The case of Z 
is more complicated and could be linked to the so-called positive rates conjec- 
ture [5] . We provide some potential candidates for density classification together 
with simulation experiments (Section [s]). 



2 Defining tlie density classification problem 

Let (G, •) be a finite or countable set of cells equipped with a group structure. 
Set A = {0, 1}, the alphabet, and X — A'^ , the set of configurations. For x £ X 
and u e {0, 1}, denote by |a:|„ the number of occurences of u in x. 



2.1 PCA and IPS 

Given a finite set Af a G, a transition function of neighbourhood is a function 
/ : A^ — > A. The cellular automaton ( CA ) F oi transition function / is the 
application F : X ^ X defined by: 

VxeA,V5ieG, F{x)g ^ f{{xg.^)^<zj~^). 

When the group G is Z'' or Z„ = Z/nZ, we denote as usual the law of G by the 
sign -|-, so that the definition can be written: Vx € X,Vfc e Z'^ (resp. Z„),i^(x)fc = 
f{{xk+v)veM)- 

Probabilistic cellular automata (PCA) are an extension of classical CA: the 
transition function is now a function ip : A^ — M.{A), where M.(A) denotes 
the set of probability measures on A. At each time step, the cells are updated 
synchronously and independently, according to a distribution depending on a 
finite neighbourhood [3]. This defines an application F : M{X) -> M{X). The 
image of a measure fi is denoted by fiF. 

The analog of PCA in continuous time are (finite-range) interacting particle 
systems (IPS) [1^. IPS are characterised by a finite neighbourhood Af C G, 
and a transition function / : A-^ — > A (or ip : A^ M.{A)). We attach 
random and independent clocks to the cells of G. For a given cell, the instants 
of ]R+ at which the clock rings form a Poisson process of parameter 1. Let a;* 
be the configuration at time t > of the process. If the clock at cell g rings 
at instant t, the state of the cell g is updated into f{{Xg.y)vGAf) (or according 
to the probability measure ip{{x*g.^)^^^)). This defines a transition semigroup 
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F = (F*)tgR_|_, with F* : M{X) — ^ M{X). If the initial measure is fi, the 
distribution of the process at time t is given by fj,F^ . 

In a PCA, all cells are updated at each time step, in a "synchronous" way. 
On the other hand, for an IPS, the updating is "asynchronous". Indeed, the 
probability of having two clocks ringing at the same instant is 0. 

Observe that PCA are discrete-time Markov chains, while IPS are continuous- 
time Markov processes. A measure is said to be an invariant mesasure of a 
process F, resp. (i^t)t, if fiF — fj,, resp. fiFt = for all t e M+. 

2.2 The density classification problem on Z„ 

The density classification problem was originally stated as follows: find a finite 
neighbourhood A/" C Z and a transition function / : A such that for 

any integer n > 1 and any configuration x G A^", when applying the CA F 
of transition function / to x, the sequence of iterates {F''{x))k>o reaches the 
fixed point = 0" if |x|o > and the fixed point 1 = 1" if \x\i > \x\o. Land 
and Belew FIT have proved that there exists no CA that perfectly performs this 
density classification task for all values of n. We now prove that this negative 
result can be extended to the PCA. It provides at the same time a new proof 
for CA as a particular case. 

Denote by 5x the probability measure that corresponds to a Dirac distribu- 
tion centred on x. 

Proposition 1. There exists no PCA or IPS that solves perfectly the density 
classification problem on Z„, that is, for any integer n > 1, and for any configu- 
ration X G A^" , {6xF^)t>Q converges to So if\x\Q > n/2 and to Si if\x\i > n/2. 

Proof. We carry out the proof for PCA. For IPS, the argument is similar and 
even simpler. Let us assume that F is a PCA that solves perfectly the density 
classification problem on Z„. Let Af be the neighbourhood of F, and let £ be 
such that Af C {-£,£)■ We will prove that for any x £ A^" (with n>2£), the 
number of occurrences of 1 after application of F to x is almost surely equal to 
|a;|i. Let us assume that it is not the case. Then, we have: 



Assume for instance that \y\i > \x\i (the case \y\i < \x\i is treated similarly). 
We first assume that |a:|i — a > n/2. For some integers k > 2,m > 2£, let us 
consider the configuration z = x'^O™ G y\^kn+m _ have \z\i — ka. Let be 
the suffix of length n — £ oi y, and let j/p be the prefix of length n — £ oi y. By 



3u, V, u', v' e A^, S,F{uysy''^^ypVu'0"'-'^'^v') > . 

Set w = uy,y''-'^ypvu'0"'~'^^v'. We have |-u;|i > k\y\i - 2£ > k{a + 1) - 2£. For 
m big enough, if we set k to be the largest integer such that k{a — n/2) < to/2. 



3x,yGA^", \x\i ^ \y\i 



SxF{y) > . 



(1) 



applying ([T]), it follows that: 



we have: 



\z\i — ka < 



kn + m 



\w\i > k{a+l) -2£> 



kn + m 



2 



2 
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So. with a positive probability, wc can reach a configuration with more ones than 
zeros starting from a configuration with more zeros than ones. Since F classifies 
the density with probability 1, the new configuration can be considered as an 
initial condition that needs to be classified and will thus almost surely evolve to 
the fixed point 1, that is, a bad classification will occur, which contradicts our 
hypothesis. 

The case = a < n/2 is analogous, except that wc now consider config- 
urations z of the form x*^!™ G j[^kn.+m a,nd choose the integers k,m such that 
ka + m< {kn + m) /2 and k{a + l)+m-2l> (kn + m) /2. 

We have proved that for any x G .4^" (with n > £), the number of occur- 
rences of ones after application of F to x is almost surely equal to \x\i. This is 
in contradiction with the fact that F classifies the density. □ 

This proposition can be extended to larger dimensions: for any d> 1, there 
is no PCA or IPS that classifies perfectly the density on all the groups of the 
form Z„j X ... X . 

2.3 The density classification problem on infinite groups 

Let us define formally the density classification problem on infinite groups. 

We denote by /ip the Bernoulli mcasm'c of parameter p, that is, the product 
measure of density p on X = . A realisation of /Xp is obtained by assigning 
independently to each element of G a label 1 with probability p and a label 
with probability 1 — p. We denote respectively by and 1 the two uniform 
configurations 0*^ and 1*^ and by 5x the probability measure that corresponds 
to a Dirac distribution centred on x. 

The density classification problem is to find a PCA or an IPS F, such that: 



The notation — stands for the weak convergence of measures. In our case, 
the interpretation is that for any finite subset K C G, the probability that at 
time t, all the cells of K are labelled by (resp. by 1) tends to 1 if p < 1/2 
(resp. if p > 1/2). Or equivalently, that for any single cell, the probability that 
it is labelled by (resp. by 1) tends to 1 if p < 1/2 (resp. if p > 1/2). 

2.4 From subgroups to groups 

Next proposition has the following consequence: given a process that classifies 
the density on Z^, we can design a new one that classifies on Z'' for d > 2. The 
idea is to divide Z'^ into Z^-layers and to apply the original process indepen- 
dently on each layer. 

Proposition 2. Let H be a subgroup of G, and let Fh be a process (PCA or 
IPS) of neighbourhood J\f and transition function f that classifies the density 
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on ■ We denote by Fq the process on A'^ having the same neighbourhood J\f 
and the same transition function f . Then, Fq classifies the density on A'^ . 

Proof. Since H is a subgroup, the group G is partitioned into a union of classes 
giH, g2H, . . . We have J\f C H, so that if an element g S G is in some class giH, 
then for any v £ Af, the element g ■ v is also in giH. Since Fjj classifies the 
density, on each class giH, the process Fq satisfies Thus for any cell of G, 
the probability that it is labelled by (resp. by 1) tends to 1 if p < 1/2 (resp. 
ifp>l/2). □ 

3 Classifying the density on Z^: Toom's rule 

To classify the density on Z^, a natural idea is to apply the majority rule on a 
cell and its four direct neighbours. Unfortunately, this does not work, neither 
in the CA nor in the IPS version. Indeed, an elementary square of four cells in 
state 1 on a background of O's is a fixed point for the process. For p £ (0, 1), 
monochromatic elementary squares of both colors appear almost surely in the 
initial configuration which makes the convergence to or 1 impossible. 

Another idea is to apply the majority rule on the four nearest neighbours 
(excluding the cell itself) and to choose uniformly the new state of the cell 
in case of equality. In the IPS setting, this process is known as the Glauber 
dynamics associated to the Ising model. It has been conjectured to classify the 
density, but the result has been proved only for values of p that are sufficiently 
close to or 1 0. 

To overcome the difficulty, we consider the majority CA but on the asym- 
metric neighbourhood Af — {(0, 0), (0, 1), (1, 0)}. We prove that this CA, known 
as Toom's rule [3[7j, classifies the density on 7?. Our proof relies on the prop- 
erties of the percolation clusters on the triangular lattice 10;. We then define 
an IPS inspired by this local rule and prove with the same techniques that it 
also classifies the density. 

3.1 A cellular automaton that classifies the density 

Let us denote by maj : A'^ — > A, the majority function, so that m&j{x,y, z) = 
0iix + y + z<2 and lifa; + y + z>2. 

Theorem 1. The cellular automaton T : A^ — > A^ defined by: 

T{x)i^j = maj(xij,a:jj+i,a;i+ij) 
for any x £ A^ Ahj) ^ classifies the density. 

Proof. By symmetry, it is sufficient to prove that if p > 1/2, then (/ipT")„>o 
converges weakly to ^i. 

Let us consider the triangular lattice of sites (vertices) 7? and bonds (edges) 
{{(i, j), (^, J + 1)}, {(^, j), (^ + 1, j)}, {(^ + 1, j), (i, J + 1)}, (i, j) £ I?}. We recall 
that a 0-cluster is a subset of connected sites labelled by which is maximal for 
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inclusion. The site percolation threshold on the triangular lattice is equal to 1 /2 
so that, for p > 1/2, there exists almost surely no infinite 0-cluster [TU]. Thus, 
if So denotes the set of sites labelled by 0, the set consists almost surely of 
a countable union Sq = U^gN'S'/c of finite 0-clusters. Moreover, the size of the 
0-clusters decays exponentially: there exist some constants k and 7 such that 
the probability for a given site to be part of a 0-cluster of size larger than n is 
smaller than Ke~'*^, see [TU]. 

Let us describe how the 0-clusters are transformed by the action of the CA. 
For S C 7?, let I5 be the configuration defined by {ls)x = 1 if x £ 5 and 
(Ig)^ = otherwise. Let TiS) be the subset S" of such that T(ls) = Is'- 
By a simple symmetry argument, this last equality is equivalent to T{lz,'^\s) = 
lz2\5/. We observe the following. 

• The rule does not break up or connect different 0-clusters (proved by 
Gacs [3 Fact 3.1]). More precisely, if S consists of the 0-clusters {Sk)k, 
then the components of T{S) are the nonempty sets among {T{Sk))k- 

• Any finite 0-cluster disappears in finite time: if 5 is a finite and connected 
subset of Z^, then there exists an integer n > 1 such that T^{S) = 0. 
This is the eroder property [3]. 

• Let us consider a 0-cluster and a rectangle in which it is contained. Then 
the 0-cluster always remains within this rectangle. More precisely, if R is 
a rectangle set, that is, a set of the form {{x,y) G Z^ | oi < a; < a2, 61 < 
y < ^2}, and if 5 C -R, then for all n > 1, T"{S) C R (proof by induction). 

Let us now consider all the 0-clusters for which the minimal enveloping 
rectangle contains the origin (0, 0). By the exponential decay of the size of the 
clusters, one can prove that the number of such 0-clusters is almost surely finite. 
Indeed, the probability that the point of coordinates (m, n) is a part of such a 
cluster is smaller than the probability for this point to belong to a 0-cluster of 
size larger than max(|m|, \n\). And since 

^g-7max(|mM„|) < 4k ^ (we"^™ + ^ C""^") < OO, 
(m,n)eZ2 mSN n>m 

we can apply the Borel-Cantelli lemma to obtain the result. Let Tq be the 
maximum of the time needed to erase these 0-clusters. The random variable 
Tq is almost surely finite, and after Tq time steps, the site (0,0) will always be 
labelled by a 1. As the argument can be generalised to any site, it ends the 
proof. □ 

We point out that Toom's CA classifies the density despite having many 
different invariant measures. For example: 

• Any configuration x that can be decomposed into monochromatic North- 
East paths (that is, Xij — Xij+i or Xij — Xi+ij for any is a fixed 
point and Sx is an invariant measure. 
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• Let y be the checkerboard configuration defined by yi,j = if i + j is 
even and yij — 1 otherwise, and let z be defined by Zi j — 1 — yi j. Since 
we have T{y) — z and T{z) = y, the two configurations y and z form a 
periodic orbit and {Sy + Sz)/'2 is an invariant measure. 

3.2 An interacting particle system that classifies the den- 
sity 

We now define an IPS for which we use the same steps as above to prove that 
it classifies the density. 

Note that the exact IPS analog of Toom's rule might classify the density 
but the above proof does not carry over since, in some cases, different 0-clusters 
may merge. To overcome the difficulty, we introduce a different IPS with a new 
neighbourhood of size 7: the cell itself and the 6 cells that are connected to it 
in the triangular lattice defined in the previous section. 

For a G A, set a — 1 — a. 

Theorem 2. Let us consider the following IPS: for a configuration x g , 
we update the state of the cell (i,j) by applying the majority rule on the North- 
East- Centre neighbourhood, except in the following cases (for which we keep the 
state unchanged) : 

1. = Zi-i j+i = Xi+ij^i = = Xi+ij and (xi,j^i = Xij or 

2. Xi^j J -{-1 — 1 + 1 — -^iH-ljj — 1 and Xi—\ j — Xi^j , 

3- -^{.j — •^i—ijj — 1 — — — and x^^j—i — xi j. 

This IPS classifies the density. 

The three cases for which we always keep the state unchanged are illustrated 
below for the case where xij = 1 (central cell). In the first case, we allow to flip 
the central cell if and only if the two cells marked by a dashed circle are also 
labelled by 1. Otherwise, the updating could connect two different 0-clusters 
and break up the 1-cluster to which the cell {i,i) belongs to. The second and 
third cases are analogous. 
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The proof is similar to the one of Theorem [T] but involves some additional 
technical points. 

Proof. We assume as before that p > 1/2. Like the CA of the previous section, 
the new process that we have defined has the property not to break up or connect 
different clusters. Furthermore, if we consider a 0-cluster and the smallest rect- 
angle in which it is contained, we can check again that the 0-cluster will never go 
beyond this rectangle. As before, we only need to prove that any finite 0-cluster 
disappears almost surely in finite time to conclude the proof. We consider a 
realisation of the trajectory of the IPS with initial density /ip. We associate to 
any finite 0-cluster C C the point v{C) = max{(i, j) S C}, where the order 
is the lexicographic order on the coordinates (we set u(0) = (—00, —00)). The 
point v{C) is thus the upmost point of C among its rightmost points. Let us 
consider at time some finite 0-cluster Cq. We denote by Ct the state of this 
cluster at time t. 

Claim. The value v{Ct) is nonincreasing. Moreover, if t > is such that 
Ct 7^ 0, then there exists almost surely a time t' > t such that v{Ct') < v{Ct). 

Let us prove the claim. Let us denote by a; € a configuration attained at 
some time i, and let = v{Ct). By definition of u(Ct), if a cell of coordinate 
{i -f 1,/) is connected to a cell of Ct, then Xi+i.j' = 1. Either we have also 
Xi+ij'+i — 1 and the cell (i -I- will not flip. Or = 0, but in 

this case, since [i + 1,/ + 1) does not belong to Ct, Xi,j'+i — 1 and the cell 
of Ct to which is connected {i + 1,/) is necessarily {i,j'). So, Xij' — and 
= 1, once again by definition of v{Ct). Depending on the value of 
Xi+2.j'-i, either rule 1 or rule 2 forbids the cell {i + to flip. In the same 
way, we can prove that if a cell of coordinate {i,j'),j' > j is connected to Ct, 
then it is not allowed to flip. This proves that v{Ct) is nonincreasing. In order 
to prove the second part of the claim, we need to show that the cell {i,j) will 
almost surely be flipped in finite time. By definition of {i,j) = v{Ct), we know 
that Xij^i = Xi^i,j = Xi+ij-i = 1. The cell {i,j) will thus be allowed to 
flip, except if x^^ij+i — Xi_j^i = and x^^i j = 1. But in that case, the cell 
{i — will end up flipping, except if Xi^i,j-i = Xi^2.j+i = l,a:^i-2,j — 0, and 
so on. Let W„ — {{i — n,j), {i — 1 — n,j + 1), (i — n,j — 1)}. If for each n, 
the cells of Wn are in the state (n mod 2), then none of the cell {i — n,j) is 
allowed to flip (see Figure [2ja). But recall now that the initial measure is /ip. 
There exists almost surely an integer n > such that the initial state of the cell 
(i — n,j) is not {n mod 2). Let m{t) be the smallest integer n whose value at 
time t is not n mod 2. Then, one can easily check that m{t) is non-increasing, 
and that it reaches in finite time. Thus, the cell (i, j) ends up flipping and we 
have proved the claim. 

The example of Figure [2]b illustrates how the proof works. Here, no cell of 
the cluster Ct is allowed to flip, but since the cells on the right and on the top 
of v{Ct) cannot flip either, v{Ct) does not increase. The cell at the left of v{Ct) 
will end up flipping, and v{Ct) will then be allowed to flip. 

Since we know that a 0-cluster cannot go beyond its enveloping rectangle, a 
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(a) (b) 
Figure 2: Illustration of the proof of Theorem [2] 



direct consequence of the claim is that any 0-cluster disappears in finite time. 
This allows us to conclude the proof in the same way as for the majority cellular 
automaton. □ 



4 Classifying the density on regular trees 

Consider the finitely presented group Tn — (ai, . . . ,a„ | of — 1). The Cayley 
graph of Tn is the infinite n-regular tree. For n — 2k, we also consider the free 
group with k generators, that is, Tjfc = ('^ii ■ • ■ i Ofe I •)■ The groups and 
are not isomorphic, but they have the same Cayley graph. 

4.1 Shortcomings of the nearest neighbour majority rules 

For odd values of n, a natural candidate for classifying the density is to apply 
the majority rule on the n neighbours of a cell. But it is proved that neither 
the CA (see [12] for n = 3,5, and 7) nor the IPS (see [H] for n — 3) classify the 
density. 

For n = 4, a natural candidate would be to apply the majority on the four 
neighbours and the cell itself. We now prove that it does not work either. 

Proposition 3. Consider the group T'^ — {a,h \ ■). Consider the majority CA 
or IPS with neighbourhood M = {1, a, fe, a^^, 6^^}. For p G (1/3,2/3), the 
trajectories do not converge weakly to a uniform configuration. 

Proof. If p € (1/3,2/3), then we claim that at time 0, there are almost surely 
infinite chains of zeros and infinite chains of ones that are fixed. Let us choose 
some cell labelled by 1. Consider the (finite or infinite) subtree of I's originating 
from this cell viewed as the root. If we forget the root, the random tree is 
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exactly a Galton- Watson process. Indeed, the expected number of children of a 
node is 3p and since 3p > 1, this Galton- Watson process survives with positive 
probability. Consequently, there exists almost surely an infinite chain of ones at 
time somewhere in the tree. In the same way, since 3(1 — p) > 0, there exists 
almost surely an infinite chain of zeros. □ 

As for Z^, we get round the difficulty by keeping the majority rule but 
choosing a non-symmetrical neighbourhood. 

4.2 A rule that classifies the density on 

In this section, we consider the free group = (a, 6|-), see Fig. [3](a). 
Theorem 3. The cellular automaton F : A^'>- — > A^^i- defined by: 

F{x)g ^■!ha.^{Xga,Xgab,Xgab-i) 

for any x G A^^^^g G T[, classifies the density. 

Proof. Wc consider a realisation of the trajectory of the CA with initial distri- 
bution /Lip. Let us denote by Xg the random variable describing the state of the 
cell g at time n. Since the process is homogeneous, it is sufficient to prove that 
X" converges almost surely to if p < 1/2 and to 1 if p > 1/2. Let us denote 
by h : [0, 1] [0, 1] the function that maps a given p G [0, 1] to the probability 
h(p) that maj (X, Y, Z) = 1 when X, F, Z are three independent Bernoulli ran- 
dom variables of parameter p. An easy computation provides h{p) = ip^ — 2p^, 
and one can check that the sequence (/i"(p))n>o converges to if p < 1/2 and 
tolifp>l/2. 

We prove by induction on n G N that for any fc G N, the family £k{n) = 
{Ar"^„^ I ui,U2,...,Uk G {a,ab,ab~^}} consists of independent Bernoulli 
random variables of parameter h'"{p). By definition of fj,p, the property is true 
at time n = 0. Let us assume that it is true at some time n > 0, and let us fix 
some A; > 0. Two different elements of Skin, + 1) can be written as the majority 
on two disjoint triples of £k+i{n). The fact that the triples are disjoint is a 
consequence of the fact that {a, ab, ab^^} is a code: a given word g £ G written 
with the elementary patterns a,ab,ab~^ can be decomposed in only one way 
as a product of such patterns. By hypothesis, the family £k+i{n) is made of 
i.i.d. Bernoulli variables of parameter h^{p), so the variables of £k{n -f 1) are 
independent Bernoulli random variables of parameter Consequently, 
the process F classifies the density on □ 

Let us mention that from time n > 1, the field {Xg)g^c is not i.i.d. For 
example, X| and X^j,_i^_i are not independent since both of them depend on 

On T^f, = {ai, . . . ,ak\-), one can either apply Prop. [2] to obtain a cellular 
automaton that classifies the density, or define a new CA by the following for- 
mula: F{x)g ^ ma.2{xgai,Xgaia2,Xg^^^-i, . . . ,Xgaiak,Xg^^^-i) and check that it 
is also classifies the density. 
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(a) (b) 



Figure 3: The ceUular automata described by Theorem |3j and Theorem |4j 

It is also possible to adapt the above proof to show that the IPS with the 
same local rule also classifies the density. 

4.3 A rule that classifies the density on T3 

We now consider the group T3 = (a, b, c \ = b'^ = cP = I) . 
Theorem 4. The cellular automaton F : A^'-^ — > A^'"^ defined by: 

^{^^g — {^gab^ -^gac^ -^gacbc) 

for any x £ A"'"^,g S T3, classifies the density. 

Proof. The proof is analogous to the previous case. We prove by induction on 
n e N that for any k £ N, that the family £k{n) — \ Ui, U2, ■ ■ . ,uic G 

{ab, ac, acbc}} consists of independent Bernoulli random variables of parameter 
h^^{p), the key point being that {ab, ac, acbc} is a code. □ 

Once again, as explained in Prop. [2] since we have a solution on T3, we 
obtain a CA that classifies the density for any Tn,n > 3, by applying exactly 
the same rule. The corresponding IPS on T!„ also classifies the density. 

5 Classifying the density on Z 

The one-dimensional case appears as much more difficult than the other cases 
and we are not aware of any solution to the density classification problem on Z. 
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However, if we slightly change the formulation of the problem, simple solutions 
do exist. We first give one such modification and then go back to the original 
problem and describe three models, two CA and one PCA, that are conjectured 
to classify the density. We also provide some preliminary analytical results as 
well as experimental confirmations of these results by using numerical simula- 
tions. 

In the examples below, the traffic cellular automaton, rule 184 according 
to Wolfram's notation, plays a central role. It is the CA with neighborhood 
Af = {—1, 0, 1} and local function traf defined by: 



x,y,z 


111 


110 


101 


100 


Oil 


010 


001 


000 


traf (a;,y,z) 


1 





1 


1 


1 












This CA can be seen as a simple model of traffic flow on a single lane: the 
cars are represented by I's moving one step to the right if and only if there are 
no cars directly in front of them. It is a density-preserving rule. 

5.1 An exact solution with weakened conditions 

On finite rings, several models have been proposed that solve relaxed variants 
of the density classification problem. We concentrate on one of these models 
introduced in jl5j. The original setting is modified since the model operates 
on an extended alphabet, and the criterium for convergence is also weakened. 
Modulo this relaxation, it solves the problem on finite rings Z„. We show the 
same result on Z. 

Proposition 4. Consider the cellular automaton F on the alphabet B — , 
with neighbourhood J\f = {—1, 0, 1}, and local function f ~ (/i, /2) defined by: 



{0 if xi = yi = 
1 ifxi^yi = l (3) 
?/2 otherwise 



The projections iJ,pF"{A^ x •) converge to Sq if p < 1/2 and to Si if p > 1/2. 

Intuitively, the CA operates on two tapes: on the first tape, it simply per- 
forms the traffic rule; on the second tape, what is recorded is the last occurrence 
of two consecutive zeros or ones in the first tape. If p < 1/2, then, on the first 
tape, there is convergence to configurations which alternate between patterns 
of types O'^ and (10)^. Consequently, on the second tape, there is convergence 
to the configuration Sq. Wc formalise the argument below. 

Proof. Let T : — ^ A^ be the traffic CA, see above. Following an idea 
of Belitsky and Ferrari [Tj, we define the recoding ip : A^ — {—1,0,1}^ by 
tlj{x)i = 1 — Xi — Xi-i. Consider (i/j o T"(a;))„>o, the recodings of the trajectory 
of the CA originating from x G {0, 1}^. There is a convenient alternative way 
to describe {tp o T"(a:))„>o. It corresponds to the trajectories in the so-called 
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Ballistic Annihilation model: 1 and —1 are interpreted as particles that we call 
respectively positive and negative particles. Negative particles move one cell to 
the left at each time step while positive particles move one cell to the right; and 
when two particles of different types meet, they annihilate. 

Consider the Ballistic Annihilation model with initial condition fipip for p > 
1/2. The density of negative particles is p^, while the density of positive particles 
is (1 — p)^. During the evolution, the density of positive particles decreases to 
0, while the density of negative particles decreases to 2p — 1. In particular, 
the negative particles that will never disappear have density 2p — 1 (see [1] for 
details). We can track back the position at time of the "eternal" negative 
particles. Let X be the (random) position of the first eternal particle on the 
right of cell 0. After time X, the column in the space-time diagram contains 
only or —1 values. This key point is illustrated in the figure below. 




We now go back to the traffic CA with initial condition distributed according 
to fip for p > 1/2 and concentrate on two consecutive columns of the space-time 
diagram. The property tells us that after some almost surely finite time, the 
columns contain only the patterns 11,01, or 10. 

For the CA defined by Eq. [3] with an initial condition distributed according 
to a measure /i satisfying ^(- x A^) — fip for p > 1/2, the above key point 
gets translated as follows: in any given column of the space-time diagram, after 
some a.s. finite time, the column contains only the letters (0,1) or (1,1). In 
particular, fipF^{A^ x •) converges weakly to 5^ if p > 1/2. □ 

5.2 Density classifier candidates on Z 

The GKL cellular automaton. The Gacs-Kurdyumov-Levin (GKL) cel- 
lular automaton is the CA with neighbourhood Af = {—3,-1,0,1,3} defined 

by 

\ma.j{xk,Xk+i,Xk+3) if Xfc = 1 
[maj {xk,Xk-i,Xk-3) II Xk = 0. 

for any x € A^, fc G Z. 

The GKL CA is known to be one of the best performing CA for the density 
classification on finite rings (see Fig. [1]). It has also been proven to have the 
eroder property: if the initial configuration contains only a finite number of ones 
(resp. zeros), then it reaches (resp. 1) in finite time, see |9j. 
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GKL, d < 1/2 



GKL, d > 1/2 




Figure 4: Two space-time diagrams of GKL (top) and Kari's PGA (bottom) for 
n = 149. Initial condition with density 70/149 (left) and 77/149 (right). 



Kari traffic cellular automaton. This GA is defined by the composition 
of the two following rules applied sequentially at each time step: (a) apply the 
traffic rule, (b) change the 1 into a in every pattern 0010 and the into a 1 
in every pattern 1011 (see Fig. |4]). 

Like GKL, Kari traffic GA has a neighbourhood of radius 3. Both GA also 
share the combined symmetry consisting in swapping and 1 and right and 
left. Kari traffic has also the eroder property and it appears to have comparable 
qualities to GKL concerning the density classification task, see [15] . Kari traffic 
GA is closely related to Kurka's modified version of GKL [T3]. 

The majority-traffic probabifistic cellular automaton. The majority- 
trafhc PGA of parameter a G (0, 1) is the PGA of neighbourhood Af = {—1, 0, 1} 
and local function: 

ip{x,y,z) = a(5maj(a;,j,,z) + (1 - Oi) hraf(x,v,z)■ 
lll words, at each time step, we choose, independently for each cell, to apply the 
majority rule with probability a and the traffic rule with probability 1 — a (see 
Fig. [5]). 

The majority-traffic PGA has been introduced by Fates |4j who has proved 
that it "classifies" the density on a finite ring with an arbitrary precision: for 
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Figure 5: Two space-time diagrams of the majority-trafRc PCA for a — 0.1 and 
n = 149. The same initial condition with density 70/149 is used. The case seen 
on the right is a rare event (evolution towards a bad classification). 

any n S N and any e > 0, there exists a value a„_£ of the parameter such that 
on Z„, the PCA converges to the right uniform configuration with probability 
greater than 1 — e. 

Conjecture 1. The GKL CA, the Kari traffic CA, and the majority-traffic 
PCA with < a < ac (for some < ac < 1/2 j classify the density. 

5.3 Invariant Measures 

Following ideas developed by Kurka [13], we can give a precise description of 
the invariant measures of these PCA. 

Proposition 5. For the majority-traffic PCA and for Kari traffic CA, the 
extremal invariant measures are So, Si, and {S(Qiy£-\-S^iQ-^z)/2. For GKL, on top 
of these three measures, there exist extremal invariant measures of density p for 
any p e [1/3,2/3]. 

Proof. Majority-trafRc PCA. Let us consider the majority-traffic PCA P 
of parameter a £ (0,1). We denote by [xq, . . . , Xn\k the cylinder set of all 
configurations y € satisfying yt+i = Xi for < i < n. Let fj, be any shift- 
invariant measure. An exhaustive search shows that if at time 1, we observe the 
cylinder [100] o then there are only eight possible cylinders of size 5 at time 0, 
that are: 

[01100]„i, [10000]_i, [100Gl]_i, [10010]_i, 
[1010G]_i, [1100G]_i, [11001]_i, [11100]_i. 

If we weight each cylinder by the probability to reach [100] o from them, we 
obtain the following expression: 

fiP[100] = a{l - a)^[01100] + (1 - a)fi[10000] + (1 - a)^[100Gl] + (1 - a)/i[10G10] 
-\- aAi[101G0] + aV[110G0] -I- a^fi[UOQl] + a{l ~ a)^[11100]. 
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Since the measure /i is supposed to be shift-invariant, wc do not need to specify 
the position of the cyhnders: we denote by ij[xo, . . . , x„] the value /i([a;o, ■ ■ • , Xn]k) 
which does not depend on fc e Z. Gathering the terms with the same coefBcient, 
we have: 

HP[100] = (1 - Q!)(At[100] - At[10011]) + a/i[10100] + a{l - a)/i[1100] + ^^[llOO] 
= (1 - a)(/i[100] - ;u[10011]) + a/iflOlOO] + a/x[1100]. 

Some more rearrangements provide: 

AtP[100] = (1 - a)(Ai[100] - /x[10011]) + a(^[100] - ^[00100]) 
= /i[100] - (1 - a)/i[10011] - aM[00100]. 

This proves that the sequence (^P"[100])„>o is non-increasing. Let us assume 
that fiP = fi. Then, ,u[10011] = ^[00100] = 0. 

Lot us consider the cyhnder [lO^OOll] for some n > 2. If we apply the 
majority rule on each cell except on the second cell from the left, then after 
n iterations, wc reach the cylinder [10011]. Since this occurs with a positive 
probability, we obtain that for any n > 0, /i[10"0011] = 0. This provides: 
H[00n] = m[00011] = Ai[000011] = . . . = /i[0"ll] for any n > 2. Consequently, 
/i[OGll] = 0. From a cylinder of the form [00(10)"11], if we choose to apply 
the majority rule on each cell, then we reach the cylinder [0011] in n steps. 
Thus, /u[00(10)"ll] = for any n > 0. It follows that fj, can be written as the 
sum fj, = fX(f + 111 of two invariant measures, where /ig charges only the subshift 
So = e 1 Vfc G Z, XkXk+i ^ 00} and in the subshift Si = {a; G | VA; G 
Z, XfeXfe+i ^ 11}. Let us assume that /u[00] = (which is the case for fio). In 
the same way that we have computed /uP[110], we can compute /uP[ll], and we 
obtain: 

fJ,P[n] = aAi[0110]+a/i[1110]+a/i[1101]+/i[1011]+M[0111]-|-At[llll] 
= an[110] + aii[1101] + nill] - ii[OOU] 

= ii[n] + aii[no] + aii[noi]. 

By hypothesis, nP = jj,, so that the last equality implies that /i[110] = 0. 

In all cases, if /x is a shift-invariant measure such that /xP = /i, then /x[00] = 
M(0), m[11] = and m[01] = m[10] = m((01)^) = m((10)^). 

Kari traffic CA. If at time 1, we observe the pattern 100 at position 0, 
then, at time 0, that is to say before the application of Kari's CA, this same 
pattern was present at position —1. Indeed, one can check that none of the 
cell of the pattern 100 can have been obtained by the transformation (b) (see 
the definition of Kari traffic CA), so that one has just to consider the possible 
history of 100 by the traffic CA. In the same way, one can prove that if at time 
1, we observe the pattern 110 at position 0, then, at time 0, this same pattern 
was present at position 1. Let /x be a shift- invariant measure such that nK = ii, 
where K denotes Kari traffic CA. A consequence of the result on the patterns 
100 and 110 that we have just described is that /uii'"'+^[110a;100] = for any 
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n > and any x G A". But since ^K'^'^^ = /i, we obtain /i[110xlOO] = for 
any word x on the alphabet A. Once again, we can write = /io + Mi where /io 
and Hi are two invariant measures defined on Eq and Ei. 

Let us consider a configuration of Eq, that is, without the pattern 00. By 
the traffic rule, each of the configuration will move one cell to the left. Then 
by rule 1, if a is at distance greater than 2 from the next on its right, it is 
erased by rule (b). The result follows. 

GKL. Any word x € A^ that is a concatenation of the patterns u = 001 and 
V — Oil is a fixed point of the GKL cellular automaton: if a;„ = 0, then either 
Xn-i = or Xn-3 = SO that F(x)n — and if a;„ = 1, then either Xn+i = 1 
or Xn+3 = 1 SO that F{x)n = 1. As a consequence, GKL has extremal invariant 
measures of density p for any p £ [1/3, 2/3]. □ 

To summarize, the majority-traffic and Kari traffic CA have a simpler set 
of invariant measures. It does not rule out GKL as a candidate for solving the 
density classification task, but rather indicates that it could be easier to prove 
the result for majority-traffic or Kari traffic CA. 

5.4 Experimental results 

Conjecture [l] was first motivated by the observation of the space-time diagrams, 
see Fig. |4] and [5] We provide some numerical results that support this conjec- 
ture. For a given ring size n, we generate an initial configuration x by assigning 
to each cell the state 1 with a probability p and the state with probability 
1 — p. Let us denote by d(x) the actual density of 1 in the configuration x. We 
let the system evolve until it reaches a fixed point or 1 and see if the fixed 
point is for d{x) < 1/2 and 1 for d{x) > 1/2. The quality Q{n) corresponds 
to the proportion of good classifications on a given ring of size n. 

Figure [g] shows the evolution of Q{n), each value of Q{n) being evaluated 
over 100 000 samples. For the three rules, the plots are in agreement with the 
hypothesis that the asymptotic value of Q{n) is 1. From a qualitative point of 
view, we observe that for all values of d the quality decreases before increasing, 
but this is only a border phenomenon for very small ring sizes. We also observe 
that when the initial density d increases from 0.45 to 0.48, the value of n needed 
to attain a given quality Q{n) increases dramatically. For d = 0.49, the change 
of derivative of the curve Q{n) becomes hardly visible. However, our belief is 
that Q{n) will approach one as the lattice size grows, no matter how close p is 
to the critical density 1/2. To see why this holds, consider the error rate err{d) 
obtained as the probability to make a bad classification when the initial configu- 
ration is equal to d. We experimentally observed that, as n grows, the function 
err(d), which is defined for values k/n with k e {0, . . . ,n}, approaches a Bell 
curve whose mean is centred on 1/2 and whose tail progressively approaches 
the 0-axis (see Fig. [t]). At the same time, for a fixed p, the probability p{k/n) 
that the initial configuration has a density of ones equal to k/n follows a bino- 
mial distribution of parameter p. We can thus calculate the global error rate 
E{n) = 1 — Q{n) with E(n) = X]fe=o s.rr(k/n)-p{kln). Intuitively, it can be seen 



18 






Figure 6: Experimental determination of the quality of classification Q{n) as a 
function of ring size n. Cells are initialised with a probability p to be in state 1. 
Each point represents an average computed out 100 000 trajectories. 
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Figure 7: Majority-traffic rule with a — 0.25: Evolution of the error rate as a 
function of the initial density when doubling the ring size. 

that as n grows to infinity, the two distributions err{k/n) and p{k/n) progres- 
sively separate as their mean value is different and their variance approaches 0. 
As a consequence, for larger values of n, the value E{n) progressively vanishes 
and the quality approaches 1. 

By contrast, there are other PCA, such as the rule which was originally 
studied by Fuks [6', and which consists in doing a copy of the right or left 
neighbour with a fixed probability p < 1/2, and the identity otherwise. This 
rules preserves the density in average at each time step for finite rings (see 
details in [3]). As a consequence, we have Q(n) = max (p, I — p) which implies 
that the increase of n does not improve the average performance of the system. 
In the infinite case, the preservation of the density is exact and can not allow the 
system to classify the density. We experimentally observed the same qualitative 
behaviour for the rule used by Schiile [TB] or for the Majority- Traffic rule for a > 
1/2. We believe that there exists a strong relationship between the asymptotic 
behaviour of the rule on finite rings and the ability to classify the density on Z. 

5.5 Link with the positive rates "conjecture" 

The difficulty of classifying the density on Z is related to the difficulty of the 
ergodicity problem on Z. By definition, a PCA or an IPS has positive rates if 
all its local probability transitions are different from and 1. In Z^, there exist 
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positive rates PCA and IPS that are non-ergodic (for instance, a "positive rates 
version" of Toom's rule It had been a long standing conjecture that all 

positive rates PCA and IPS on Z are ergodic. Gacs disproved the conjecture 
by exhibiting a complex counter-example with several invariant measures, but 
with an alphabet of cardinality 2^* instead of 2 |5]. If we knew a process that 
classifies the density on Z, it could pave the way to exhibit simple examples of 
positive rates processes that are non-ergodic. 
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